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Abstract 

We explore the structure of the p-adic automorphism group T of the infinite rooted 
regular tree. We determine the asymptotic order of a typical element, answering an 
old question of Turan. 

We initiate the study of a general dimension theory of groups acting on rooted trees. 
We describe the relationship between dimension and other properties of groups such 
as solvability, existence of dense free subgroups and the normal subgroup structure. 
We show that subgroups of T generated by three random elements are full-dimensional 
and that there exist finitely generated subgroups of arbitrary dimension. Specifically, 
our results solve an open problem of Shalev and answer a question of Sidki. 

1 Introduction 

Let T = T{d) denote the infinite rooted d-aij tree and let H C Sym((i) be a permutation 
group. Let T{H) denote the infinite iterated wreath product of H acting on T with respect 
to H. For example, r(Sym((i)) is the full automorphism group of T{d). Let r„(if) denote 
the n-fold wreath product of H, acting on T„, the ci-ary tree of depth n. 

The case H = Cp, the cyclic group of order p is of particular interest. The pro-p group 
r(p) = r(Cp) obtained this way is called the group of p-adic automorphisms. The group 
Tn{p) is called the symmetric p-group of depth n, as it can also be obtained as the Sylow 
p-subgroup of the symmetric group Sym(p"'). 

The first goal of this paper is to analyze the elements of T{H). We answer a question 



of Turan fsee iPa. 



Erdos and Turan 



a. 



fy and Szalav ( 1983| )). which asks for an analogy of the famous theorem of 
()l965h about the distribution of orders of random elements in Sym(n). 
Let p^" denote the order of a random element of the symmetric p-group r„(p) and let 
be the solution of the equation a{l — a)^/"^^ = 1 — 1/p in the open unit interval. 



Theorem 1. We have Kn/n Op in probability. 
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A similar result holds for general H (see Section |2I). The proof of this theorem depends 
on a new measure-preserving bijection between conjugacy classes of random elements and 
Galton- Watson trees, bringing the theory of stochastic processes to bear upon the subject. 
Theorem [T] was conjectured bv IPalfv and Szalavl ( 19831): thev proved the upper bound and 
that the variance of Kn remains bounded as n — » oo. IPuchtal ( 2001 ) showed that the limit 



in Theorem ^ exists. In a related recent paper, EvansI (|2002[ ) studies the random measure 
given by the eigenvalues of the natural representation of r„(p). 

The next goal is to understand the subgroup structure of T{j>). Since every countably 
based pro-p group can be embedded into r(p), we investigate subgroups according to their 
Hausdorff dimension. We show that the Hausdorff spectrum for the closure of subgroups 
generated by 3 elements is the whole unit interval. More precisely. 

Theorem 2. For each A G [0, 1] there exists a subgroup G C T{p) generated by 3 elements 
such that the closure of G is X- dimensional. 



This result leads to the solution of a problem of Shalev ( 2000l ). who asked whether a 
finitely generated pro-p group can contain finitely generated subgroups of irrational Hausdorff 
dimension. 

In the context of pro-finite grou ps Hausdorff dimens i on wa s introduc ed by Abercrombid 
( I994I). It wa s deeply analyzed by Barnea and Shalev ( 1997i ) (see also Shalev ( 2000[ ) and 



Barnea et al, 



(|l998i) ). They show that for closed subgroups, Hausdorff dimension agrees 
with the lower Minkowski (box) dimension. In the case of T{H), this can be computed from 
the size of the congruence quotients Gn = GT*^"Yr*^"\ where F*^"-' denotes the stabilizer of 
level n in F. In fact, dimnC is given by the liminf of the density sequence 



7„(G) =log|G„|/log|F, 



(1) 



For instance, the closure of Grigorchuk's group, a subgroup of F(2), has dimension 5/8. 

Theorem |21 is obtained using proba bilistic methods. These methods give the following 
analogue of the theorem of lDixonI (|l969l ) saying that two random elements of Sym(n) generate 



Sym(n) or the alternating group Alt(?2) with probability tending to 1. Perhaps surprisingly, 
no deterministic construction is known for large subgroups of F„(p) with bounded number 
of generators as in the following theorem. 

Theorem 3. Let e > 0. Let Gn be the subgroup generated by three random elements of the 
symmetric p-group F„(p). Then P ( IG^I > |F„(p)|-'^^^) ^ 1 as n 00. 



For the infinite group F(p), this takes the following form: with probability 1, three random 
elements generate a subgroup with 1-dimensional closure (see Theorem 17. 2j) . We conjecture 
that the same holds for two elements instead of three. Since three random elements generate 
a spherically transitive subgroup with positive probability, and F(p) is not topologically 
finitely generated, the above result shows the existence of a rich family of topologically 
finitely generated transitive full-dimensional subgroups of F(p). 



Th e analysis of randomly generated subgroups allows us to answer a question of ISidki 
()200l[ ). He asked whether the binary adding machine, an element that acts on every level of 
T(2) as a full cycle, can be embedded into a free subgroup of F(2). Call a subgroup G C T{H) 
strongly free if it is free and every nontrivial element fixes only finitely many vertices. For 
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instance, both the trivial group and the cychc subgroup generated by the adding machine 
have this property. 

Theorem 4. Let G C T{H) be a strongly free subgroup and let g G r(if) be a random 
element. Then the group {G,g) is strongly free with probability 1. 

Next, we explore a general dimension theory of groups acting on the rooted trees T(p). 
The first result in this direction shows that closed subgroups of T{p) are 'perfect' in the sense 
of Hausdorff dimension. Recall that G' denotes the derived subgroup of the group G. 

Theorem 5. Let G C T{p) be a closed subgroup. Then we have lim(7„(G') — 7„(G")) = 0. 
In particular, dimn G = dimn G' . 

As a corollary, solvable subgroups are zero- dimensional, and positive-dimensional closed 
subgroups cannot be abstractly generated by countably many solvable subgroups. 

Unlike the infinite group T{p), the finite symmetric p-groups r„(p) do have large Abelian 
subgroups of density 1 — 1/p. However, they cannot be glued to form a large subgroup of 
T{p). The following theorem explains the background of this phenomenon. 

Theorem 6. Let G C r{p) be a solvable subgroup with solvable length d. Then we have 
'Yl^=Qln{G) < Gd where G is a constant depending on p only. 

One motivation to study dimension of subgroups in T[p) comes from the theory of just 
infinite pro-p groups, which are regarded as the simple groups in the pro-p category. A 
group G is i ust in finite if every pro per quot i ent o f G is finite. By the characterization of 
Grigorchuk ( 200Cl| ) (which is based on Wilson ( 197l[ )). groups with this property fall into two 



classes: one is the so-called b ranch groups, which have a natural action on T{p) (see Section 
Hfor details). iBostonI (|2nnnh suggests a direct connection between Grigorchuk's classes and 
Hausdorff dimension; he conjectures that a just infinite pro-p group is branch if and only if 
it has an embedding into T{p) with positive-dimensional image. 

In the following we contrast known results about branch groups with new results on 
1-dimensional subgroups. We first show the following. 

Theorem 7. Let G C r{p) be a spherically transitive 1-dimensional closed subgroup. Then 
every nontrivial normal subgroup of G is 1-dimensional. 

It would be interesting to see whether a similar result holds for spherically transitive 
po sitive- dirnensio nal closed subgroups. 

Wilson ( 20001 ) conjectured that every just infinite pro-p branch group contains a non- 



abelian free pro -p subgroup. An affirma tive result is known if the group is not virtually 
torsion-free (see Grigorchuk et al. ( 2000a| )). Another beautiful result in this vein is given by 



Zelmanov ( 2000| ) on groups satisfying the Golod-Shafarevich condition. In light of Boston's 



conjecture it is natural to formulate the following. 

Conjecture 8. Let G C r(p) be a positive- dimensional closed subgroup. Then G contains a 
nonabelian free pro-p subgroup. 

This might be more attackable than Wilson's original conjecture. A step in this direction 
is the following. 
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Theorem 9. Let G C r{p) be a spherically transitive 1- dimensional closed subgroup. Then 
G contains a nonabelian free pro-p subgroup. 

Another me asure of largeness for pr ofinit e groups is whether they c o ntain de nse free 
subgro ups (see Pvber and Shalev ( 2001 ) and Soifer and Venkataramana ( 2000l )). IWilsonI 



( 2000l ) showed that just infinite pro-j9 branch groups contain dense free subgroups. Let d{G) 



denote the minimal number of topological generators for G, which may be infinite. 

Theorem 10. Let G C r(p) be a closed subgroup of dimension 1 and let k > d{G). Then 
G contains a dense free subgroup of rank k. 

In other words, the free group Fk is residually 5" where 5" denotes the set of congruence 
quotients of G. 

A common tool throughout the paper is the so-called orbit tree of a subgroup G C T{H), 
the quotient graph of T modulo the orbits of G. Orbit trees reflect the conjugacy relation 
(see lOawron et al" ( 200l[ )) and can be used to describe the structure of Abelian subgroups 



in V{p) (see Section |H)). The key observation leading to Theorem^ is the fact that the orbit 
tree of a random element is a Galton- Watson tree. A crucial step in Theorems El and 0] is 
that the orbit tree of a randomly generated subgroup has finitely many rays with probability 
1, i.e., the closure has finitely many orbits on the boundary of T fProposition l3.9j) . Theorem 
El also traces back to estimates on orbit trees. 

Another tool used in the paper is word maps between the spaces T{ v)^. A nontrivia l word 
w G Fk can be thought of as a map T{H)'^ r(if) via evaluation. iBhattachari e3 (|l995) 
proved that random substitution into a word map gives 1 with probability 0, or equivalently, 
the kernel of the map has Haar measure zero. As a result, random elements of T{H) generate 
a free subgroup with probability 1. The key result leading to Theorem ITUl is the following 
generalization. 

Theorem 11. The kernel of a word map is not full- dimensional in r{H)'^. 

Another generalization leading to Theorem 01 is that random evaluation of a word map 
yields an element of T{H) which fixes only finitely many vertices with probability 1. 

The structure of the paper is as follows. Section |21 contains the proof of Theorem ^ 
It also introduces notation and discusses conjugacy classes and random elements in T{H). 
Section |21 discusses the orbit structure of random subgroups. In Section jH we explore word 
maps and prove Theorems 01 and ^2 Sections El and El introduce the technical tools needed 
for proving Theorem El which is done in Section [71 Section El discusses small subgroups, and 
covers Theorems El and El Section El is about high-dimensional subgroups, and contains the 
proofs of Theorems HJ El El and [TUl 



2 Elements 

This section studies the statistical properties of elements of T{p). Random elements are 
described via the family tree of a Galton- Watson branching process. This allows us to 
answer Turan's question (see Theorem |T)). 
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We first describe our notation for (iterated) wreath products. Let {Hi,Xi), 1 <i <n 
be a sequence of permutation groups where n might be infinite. Let di = |Xj|. For < i < n, 
define the level i as 

dTe:=XiX ...X X,, 

and the tree T„ which has vertex set V{Tn) given by elements of the disjoint union of all 
levels. The sequence w G V{Tn) is a child of f G V(Tn) (or v is the parent of w) if w begins 
as V and has one extra element. The edge set E is the collection of {v, w} for all such v, w. 

Consider a map g that assigns to each vertex v G dTi for i < n a.n element of Hi_i. Let 
r be the set of all such maps. An element G F acts on T„ bijectively via the rule 

(xi,...,x,r:=(xf^.^«^^«,...,a;f-'---'^-^«). (2) 

and T = Hi I ... I Hn is easily checked to be a subgroup of Aut(T„). This construction also 
works in the case n = oo. 

If n is finite, then the group structure of F„ does not depend on the permutation repre- 
sentation of the last group Hn on Sym(n), only on the abstract structure of Hn. This defines 
H I K where K is an abstract group. When we talk about g & Hil H2, the wreath product 
of two groups, we will often use the shorthand g{x) for g{{x)), where x E Xi. The rule © 
yields the multiplication rule 

{gh){x) = g{x)h{x^). 

We will use the notation F„(if) for the wreath product of n copies of a permutation group 
{H,X), and the shorthand T{H) = Foo(-ff). The most important cases are the symmetric 
p-group F„(Cp) and the full automorphism group F„(Sym((i)) of T„. 

If G is a group, and S G G, then (S) denotes the group abstractly generated by S, 
that is the set of elements that can be obtained from 5* via repeated group operations. If G 
is a topological group, then we call the closure (S) the group topologically generated by 
S. 

Let G be a group acting on a tree T. 

Definition 2.1. The orbit tree of the group G is the quotient graph Tq of T modulo the 
set of orbits of G. 

It is easy to check that orbit trees are in fact trees. 

For the purposes of Proposition 12.31 the orbit trees of group elements elements need to 
contain more information about the local structure of automorphisms. Let f be a vertex of a 
tree T = and let k denote the length of the orbit of v under an automorphism g. Then 
g^ acts on the child edges of v, which are labeled naturally by elements of X. Let ig{e) C X 
denote the orbit of the child edge e. For a set (e.g. orbit) S of edges at the same level, let 
ig{S) = ig{e) for the lexicographically smallest e E S. 

Definition 2.2. The orbit tree of a tree automorphism g is the quotient graph Tg of 
T modulo the set of orbits of g, together with the labeling ig of the edges of Tg. 

The orbit tree of g E F(/7) is an if-labeled tree, defined as follows. 
An if-labeled tree is a rooted tree together with an edge-labeling £ so that for v E V, 
i is a bijection between the child edges of v and the cycles of some hy G H. 
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Two if-labeled trees are called equivalent if (1) there exist a graph isomorphism v v\ 
e ^— e' between them, and (2) for each f , there exists an & H so that i{e') = C.{e)^^ for 
all child edges e' of v' . 



The following extension of a theorem of ISushchanskvl ()l984 ) is not difficult to check, and 
motivates the notion of orbit trees. 

Proposition 2.3. Two elements ofT{H) are conjugates if and only if their orbit trees are 
equivalent. 

In most important cases, equivalence is only a little more than graph isomorphism. Say 
two labellings of a tree are equivalent if they define equivalent if-labeled trees. In the case 
H = Sym((i), two labellings are equivalent iff the length of the edge label cycles agree. In 
this case, it suffices to label edges by the cycle lengths. If H = C2, then any two labellings 
are equivalent. U H = Cp for p prime, then each vertex has either 1 or p children, and 
labeling is the same as cyclicly ordering the children whenever there are p of them. 

We now turn to the description of the orbit trees of random elements. Our convention is 
that unless otherwise specified, the term random elements denotes independent random 
elements chosen according to uniform (in the infinite case Haar) measure. 

Definition 2.4. Labeled Galton- Watson (GW) trees. Let L be the set of finite se- 
quences with elements from a set of labels L, and let z/ be a probability distribution on L. 
We define the probability distribution GW(z/) on infinite trees with edges labeled by L by 
the following inductive construction. In the first generation, we have 1 individual. Given the 
individuals at level n, each of them has a sequence of child edges picked from the distribution 
u independently. The other endpoints of the child edges form generation n + 1. 

Let (if, X) be a permutation group, and let uh be the distribution of the sequence of 
orbits of a uniform random element. 

Proposition 2.5 (Random elements and GW trees). If g E r(if) is a Haar random 
element, then the distribution of the orbit tree Tg is G\N{uh)- 

Proof. We prove this by induction. The inductive hypothesis is that the first n levels 
of Tg have the same distribution as the first n levels of a GWIuh) tree. 

Indeed, suppose this is true for n. Let w = {v, . . . , v^^ ^) G be an orbit of an element 
on level n of T. Then the child edges of w in Tg correspond to, and are labeled by orbits 
of g^{v). But g^{v) = g{v)g{v^) . . . g{v^ ^), and the factors on the right hand side are 
chosen independently uniformly from if, which implies that their product also has uniform 
distribution. □ 

The simplest consequence of this is the following asymptotic law. Let r{H) denote the 
permutation rank of if, i.e. the number of orbits of the action of H on pairs of elements of 
X (e.g. r(ii') = 2 if if is 2-transitive on X). 

Corollary 2.6. Let the group H be transitive on X, and let gn € r„(if ) be random. Then 
as n ^ oc, 

nP{gn fixes a vertex on level n) — > 2/ {r{H) — 1). 
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Proof. Let N be the number of fixed points of a random element of H on X; then it is 
well known that EA^ = 1, EA^^ = r, so Var N = r — 1. 

A vertex at level n of Tg^ corresponds to a fixed point if and only if all of its ancestor edges 
are labeled by a fixed point. This happens if it lies in the subtree of Tg^ gained by removing 
all descendant subtrees which have an ancestor edge not labeled by a fixed point. Such a 
subtree has Galton- Watson distribution with branching structure given by the number of 
fixed points iV of a random eleme nt in H. Since EiV = 1 , this is a critical tree, so by a 
theorem in branching processes (see Athreva and Nev ( 1972| )) the probability that it survives 
until generation n is asymptotic to 2/(nVariV), proving the corollary. □ 

The following corollary is immediate. 

Corollary 2.7. Let H be transitive on X , and let g G ^{H) be chosen according to Haar 
measure. Then g fixes only finitely many vertices ofT with probability 1. 

Let for an integer n, let p{n) denote the highest power of p dividing n. Let h & H he 
a uniform random element, and let p be a prime. Let fip{k) = fiH,p{k) denote the expected 
number of orbits v of h with |) = k. Let fip be the generating function for fip-. 



k=0 

and note that the sum has only finitely many non-zero terms. 
Proposition 2.8. Let gn G r„ 6e a random element, let 

. log/ip(e^) 

a„ = mm ^ . (6) 

x>o X 

Then EpdgnD/n — > ap and there exist ci, C2 > so that for all n, k we have 

P[|p(k|)-Ep(K|)|>A;]<Cle-'=^^ 
in particular, Vai{p{\gn\)) remains bounded. 

The bounded variance statement is somewhat surprising, as in most limit theorems vari- 
ance increases with n (in most cases it is on the order of n). The asymptotics of the order 
of a typical element is immediate from this result. 

Corollary 2.9. As n ^ oo, we have 

\og\gn\/n J^Op logp 

in probability, where the sum ranges over primes p dividing \H\. 

As a special case, we get an answer to Turan's question, which we restate here. Let p^" 
denote the order of a random element of the symmetric p -group r„(p) and let ap be the 
solution of the equation 

a(l-a)i/"-i = (4) 

in (0,1). 
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Theorem We have Kn/n in probability. 

Turan's goal was to find a natural analogue of the theorem of the Erdos and Turan ( 1965[ ) 



about the asymptotics of the order of a random element in Sy mfn). Their pape r start ed the 
area of statistical group theory. Theorem[T]was conjectured by Palfv and Szalay (Il983l ) : they 



proved the upper bound and a linear lower bound with a different constant. Puchtal ( 2001 ) 



shows that th e limit exists. He also studies uniformly randomly chosen conjuQ acy cl asses in 
Puchta ( 2003| ). In a related recent paper motivated by random matrix theory, Evans (2002.) 



studies the random measure given by the eigenvalues of the natural representation of r„(p). 

Proof of Theorem ^ H is the cychc group of order p, so fip{z) = 1 + z{p — 1) /p. Let 
/(A) = log fip^e^). Then ^ equals the minimum of /(A)/A, and setting the derivative to 
we get 

/(A)/A = /'(A). (5) 

At equality, this expression gives a. The right hand side is the logarithmic derivative of 
fip{e^), so it equals a = (e'^(]9 — l)/p)/(l + e^{p — l)/p)- Using this, we easily express A and 
then /(A) from a. Substitution into (0) and algebraic manipulations give EKn/n ap, and 
convergence in probability follows since the variance of Kn is bounded. □ 

For the proof of Proposition El we need to understand the order of an element in terms of 
its orbit tree. A simple inductive argument shows that if f G is an orbit, then its length 
is given by the product of the lengths of labels on the simple path 7r((),f) from the root () 
to V in Tg-. 

\v\= n 1^(^)1' (6) 

and therefore 

eg7r((),D) 

Proposition 12.111 below shows that all properties about the length of orbits of r„ can be 
understood in terms of a branching random walk. 

Let {pi}i<i<d be a sequence of primes. Consider an orbit o of an action group if on a 
set X, and let ex(o, X) denote the point in Z'^ whose ith coordinate is the exponent of the 
highest power of pi dividing \o\. Let ex{H,X) denote the multiset {ex(o) | o orbit of H}. 

Definition 2.10. Let z/ be a probability distribution on the space S of finite multi-sets of 
elements of Z"^. A branching random walk is a probability measure on infinite sequences 
n > 0} with elements from S. It is constructed recursively as follows. At time n = 
we have Hq = {0}, the set containing the origin. 

If En is already constructed, then each x E En has "offspring" x + Yi, . . . ,x + Yn. Here 
(Yi, . . . , Ya?) is picked from u independently from the past and from the offspring of other 
individuals at time n. En+i is the multi-set union of all the offspring of a; G S„. 

We call the measure 

Mx) = El > 'i(y, = x) 
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the occupation measure of the BRW. Let jl denote the generating function for the measure 
/i. 

In our case, the relative offspring positions are given by the sequence 

{ (pi(|o|), . . . ,prf(|o|)) : o orbit of/i}, 
where /i is a uniform random element of H. 

Proposition 2.11 (Orbit lengths and BRW). Let g G T{H) he a Haar random element, 
and let S„ = ex{{g),dTn). Then n > 0} is a branching random walk on Z"^. 

Proof. This is immediate from the formula (jZj) and the proof of Proposition 12. 51 □ 

Since p{\g\) equals the maximum of p{\o\) for the orbits of g, we need to understand the 
position of the highest element of H„ . The correspo n ding que s tions for the branching random 
walk have been answered by tHamm erslevI ()l974 ) , Biggind ( 1977 ) , and Dekking and Host 
()l99l[ ). The part of their results that is relevant to our setting is stated in the following 
theorem. 

Theorem 2.12. Let X„ denote the position of the greatest individual of a 1-dimensional a 
BRW whose occupation measure fi has finite support. Let 

. r log/i(e^) 
a = ml 



A>0 A 
Then 

EXn/n a, 
and there exist ci,C2 > so that for all n, 

P[|X„-EX„| > fc] <cie-^2". 

Here we present a sketch of the proof of the first statement in a simple case. 

Proof. Let denote the expected number of individuals at position k at time n. 

If we find the highest k for which this quantity is about 1, that means that on average there 
will be 1 individual at position k, and less than one individuals at positions higher than k. 
Intuitively, this k seems to be a good guess for the position of the highest individual. There 
are technical arguments to make this intuition rigorous. 

The next step is to solve fi*"'{k) = k. By considering one step in the branching random 
walk one discovers the convolution formula 

k 

fi*-{k) = Y,fi*^^~\j)fi{k-j) 

j=0 

but convolution turns into product for generating functions: /i*" = /t". For example, for the 
symmetric p-group 

fiizr = ii+zip-i)/pr, 
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and we are left with finding the power k of z for which the coefficient in the above expression 
is about 1. This gives the approximate equation 




( 



) 



k 



P 



a = k/n 
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p — 1 



and by the Stirhng formula the asymptotic version of this equation is (j3)). For the general 
case, the theory of large deviations is a standard tool for approximating fi*"'{an) for large 



3 Orbit trees of random subgroups 

Just as for single elements, there is a natural graph structure on the orbits of a subgroup 
G C T{H), see Definition I2.1I In this section, we prove that the orbit tree of random 
subgroup (i.e. subgroups generated by random elements) is a multi-type Galton- Watson 
tree. As a result, we show that the closure of a random subgroup has finitely many orbits 
on dT with probability 1, and compute the probability that it is transitive. 

Definition 3.1. Multi-type GW trees. Let T be a finite or countable set called types. 
Let T be a set of finite sequences of T. For each y G T let probability measure on 

T. A multi-type Galton- Watson tree GW(/i.) is a random tree constructed as follows. The 
first generation consists of an individual is of a given type y E T. Individuals in generation 
n have children according to the measure fiy, where y is the type of the individual. The 
children of a generation are picked independently and they form the next generation. 

Let j > I, k > 0, and {H,X) a permutation group. Consider the action of the subgroup 
generated by (j — l)/c + 1 random elements of H on X. Consider the list of the length of the 
orbits, and replace each element i of this list by £k. The distribution of the new list yields 
a probability measure f^H,j,k on finite sequences of integers. 

Proposition 3.2. Let G be the subgroup generated by j random elements ofr{H). Then 
Tg has multi-type Galton-Watson distribution G\N{fiH,j, ) , where the types correspond to the 
length of the orbits. 

First we introduce some notation. Let G = H I K a, permutation group, let Gx denote 
the stabilizer of x, Gx+ the subgroup of elements g G G^ for which g{x) = id. The group 
Gx/Gx+ "ignores" the action everywhere except at x] it is naturally isomorphic to K. 

The following observation is immediate. 

Fact 3.3. Let G C T{H), and let Tq^v be the descendant subtree of the Tq at the orbit w of 
f G T. Then Tq^v is isomorphic to Tg^/g^+ with the labels multiplied by the length of w. 

Lemma 3.4. Let {H,X) be a permutation group and let K be group. Consider the subgroup 
G generated by j Haar random elements of HlK. Conditioned on the orbit V of x, Gx/Gx+ 
has the same distribution as a subgroup generated by \V\{j — 1) + 1 random elements of G. 



n. 



□ 
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Proof. Let gi denote the randomly chosen generators of H. Consider the directed 
Schreier multi-graph (V, E) of the action of G on V. For each element v & V this graph has 
out-edges {v,i) from v to Let tti denote its fundamental group of the graph. It consists 
of equivalence classes of cycles starting from x. The cycles may contain an edge in either 
direction. The equivalence relation is generated by adding or removing immediately retraced 
steps. 

It is well-known that for a connected graph, tti is a free group generated by — |\^| + 1 
elements. Here is one way to specify a set of generators. Take a spanning tree of {V,E). 
For each edge e in the set Eq of edges outside the spanning tree, the path We is follows 
the path in the tree from a; to e_, moves along e and then back in the tree to x. Clearly, 
l^ol = 1^1 - \V\ + 1. 

Each path from x is described by a word in the generators Qi and their inverses. If two 
paths are equivalent, then the corresponding words w have evaluate at x to the same element 
of K. Moreover, a word is in Gx if and only if it corresponds to a cycle. 

It follows that Gx is generated by the words corresponding to generators of tti. It suffices 
to show that for the generators discussed above, {we/Gx+}eeEo are independent random 
elements of Gx/Gx+- 

For an edge e = {v,i) G E, let g{e) = gi{v). Respectively, let g{e) = giiy)^^ for the 
reversed edge. If w = ei . . . e/j is a cycle starting from x, then 

w/Gx+ = giei) . ..g{ek). 

Now fix the values of g{e) corresponding to e G E\Eo. Then for e G Eq, We/Gx+ equals g{e) 
multiphed on the left and on the right by fixed elements. Since the {g{e)}eeEQ are uniform 
random and independent, so are the {welGx+}eeEo, as required. □ 

Corollary 3.5. For v & T of orbit length k the subgroup G^/G^j^ has the same distribution 
as G(j-i)k+i- 

The following fact is also immediate. 

Fact 3.6. Let Vi, . . .Vk be vertices of T. Let A denote the event that none of the vertices 
w{vi) of Th equals another or an ancestor of another. Conditioned on A, the subgroups 
GyJGvi+ are independent. 

Proof of Proposition \'A.2l Proposition ^21 easily follows from Corollary Fact 
lOl and Fact EH □ 

Let q{j) denote the probability that j random elements generate a transitive subgroup 
G of H. Looking at the probability that Tq does not branch we get the following 

Corollary 3.7. 

n-l 

P{G is transitive on level n) = JJ^ 5' (l + (j — • (9) 

e=o 
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A subgroup of T{H) is called spherically transitive if it acts transitively on every 
level of the tree. As an example of Corollary 13 .71 consider two random elements of the 
automorphism group of the binary tree. In this case j = 2, H = C2, we have q{n) = 1 — 2~", 
and 

00 

P(G is spherically transitive) = - 2"^'"^) ~ 0.63. 

£=0 

In the general case, we have 

Corollary 3.8. Let j > 1, and assume that H has a transitive subgroup generated by j 
elements. Then with positive probability, j independent Haar random elements of T{H) 
generate a spherically transitive subgroup. Moreover, this probability tends to 1 as j ^ 00. 

Proof. Clearly, l — qinj) < (1 — g^)". This means that 1 — + — 1)|X|^) is summable, 
so ioT n = 00 the product is positive. □ 

Proposition 3.9. Let G be the subgroup generated by two independent Haar random ele- 
ments ofT{H). Then with probability 1, Tq has only finitely many rays. Equivalently, the 
number of orbits at level I remains bounded as £ 00. 

Proof. Consider the vertices of Tq that are orbits of length 1 in T. These induce a 
subtree with a subcritical GW distribution (the number of fixed points of just one random 
element has expectation 1 by Polya theory). 

This implies that with probability 1 (1) there exists finitely many fixed vertices of T, (2) 
there exists finitely many vertices of T with orbits of length at most k for each k. 

On the basis of Corollarv l3.8l let k be so large that the chance that (|X| — 1)A; + 1 random 
elements generate a transitive subgroup of r(i7) is at least 1 — 1/|X|. 

Now consider the tree built to a level where each orbit has at least k elements. Then 
each such orbit will stop splitting forever with probability at least 1 — 1/|X|, and if it does 
not, then it splits into at most |X| offspring with probability at most 1/|X|. Thus the orbit 
process is dominated by the subcritical GW tree with or |X| offspring with probability 
at least 1 — 1/|X| and at most 1/|^|, respectively. Hence all orbits stop splitting with 
probability 1. □ 



4 Word maps 



Let G be a group with finite Haar measure. A randomly evaluated word in a group G 
is a G-valued random variable defined by a word w in the free group by substituting k 
independent Haar random elements of G into the generators of F^. 

Ra ndomly evaluated words are perhaps the simplest possible maps G^ — > G. Bhattacharied 
showed that with probability 1, a nontrivial word randomly evaluated in r(if) does 
not give the identity; moreover, it does not stabilize a fixed ray in the tree. In this section we 
strengthen this result in two directions, and give a sufficient condition for a list of randomly 
evaluated words to have independent uniform distri bution . 



Our first result is key in answering a question of ISidkil (|200l[ ) 
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Proposition 4.1. A nontrivial randomly evaluated word in T{H) fixes only finitely many 
vertices ofT with probability 1. 

Proof. Let w = wi . . . we, where each Wi is one of the random generators or its inverse, 
and w is in reduced form. We use induction on the length i of w. The i = 1 case is exactly 
Corollary O 

Let W denote the random evaluation of w. Let f G T be a vertex. Let w'^ = wi . . .Wi, 
Vi = v'^i. Make the assumption A that v"^ = vi = v, but Vi ^ Vj for < i < j < i. We use 
the notation g{v) for the action of g E r(if) on the descendant subtree of v. Then 

e 

1=1 

Given the event A, the factors in the product are independent Haar random elements of 
r(if), and therefore w{v) is also a Haar random element of T{H). This means that with 
probability one, by Corollarv 12.71 only finitely many of the descendants of v are fixed by w. 

Now we are ready to prove the proposition. By the inductive hypothesis, there are only 
finitely many vertices v which are fixed by the evaluation of any of the proper sub-words 
Wi . . .Wj, i < j of w. Let L be the greatest level at which there is such a vertex. Then at 
level L + 1 event A holds for all vertices v that are fixed by w. In particular, all vertices at 
this level have finitely many offspring that are fixed by w, as required. □ 

This proof generalizes to the infinite wreath product of arbitrary (not necessarily identi- 
cal) finite transitive permutation groups. From probabilistic point of view, this property is 
interesting because it describes a critical phenomenon in the statistical physics sense. One 
manifestation of this is that the probability that a word is satisfied on levels 1 to n decays 
polynomially in n, as opposed to exponentially (see Corollarv 12. 6|) . 

The following corollary is immediate. 

Corollary 4.2. Let G be a subgroup abstractly generated by k random elements ofT{H). 
Then G is a free group acting freely on dT . 

In fact, we have that all elements of the countable group G have only finitely many fixed 
vertices in T. We call a countable free subgroup G of T{H) with this property strongly 
free. We are ready to prove Theorem ^ which we restate here. 

Theorem IH Let G C T{H) be a strongly free subgroup and let g G r(iJ) be a random 
element. Then the group {G,g) is strongly free with probability 1. 

Proof. The proof of Proposition 14.11 did not use the full power of randomness of all 
generators; it was sufficient to have one random generator. 

In fact, we have the following corollary to the proof of Proposition 14.11 Let gi be random 
and (72, ■ ■ - Qk fixed elements of T{H). Let if be a reduced word in the gi, containing gi or 
its inverse. If the evaluation all proper sub-words of w fixes finitely many vertices of T with 
probability 1, then the same is true for w. 

Theorem 0] follows . □ 
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The adding machine s is an element of T{p) that acts as a full cycle on T{p), or 
equivalently, one that has orbit tree given by a single ray. More precisely, it is a specific 
such element, defined by the following identity. For a vertex v = {vi, . . . ,Vn) at level n let 
N{v) = f 1 + V2P + . . . + VnP^~^- Then for every n and vertex v at level n we have 

N{v') = N{v) + 1 (modp"). 

The adding machine s is one of the simplest examples of elements of T{p) that are given by 
finite automata . Groups generated by suc h elements in clude Grigorchuk's group, and have a 
rich theory, see iGrigorchuk et al. (|2nnnbh . Isidkil (|2nnn>) showed that s and another element 



given by a finite automaton cannot generate a free subgroup of T{p). He asked ( Sidki ( 200l[ )) 
whether the adding machine and any other element can generate a free group. Since the 
adding machine generates a strongly free cyclic subgroup, we have 

Corollary 4.3 (Answer to a question of Sidki). The adding machine and a random 
element abstractly generate a free subgroup ofr{p) with probability 1. 

Simple maps M'^ ^ R tend to have the property that the pre-images of points are at most 
k — 1 dimensional. A word in k letters evaluated in T{H) is a simple map w : r{H)^ r(iJ). 
It is natural to ask whether the analogous statement is true here. The answer is no, for k = 1 
a counterexample is the map x ^ in T{H). If g E r(if) acts as a full cycle [vi, . . . ,Vpt) on 
level i of the tree, and the action g{vi) on the descendant subtrees satisfy g{vi) . . . g{vpi) = 1, 
then g is in the kernel of this map. The set of such elements has dimension 1 — p~^. 

For general k, we first need to define dimension. Consider the usual Li-metric on the 
product space r{H)'^, which is built from the metric (jl8j) on T{H). This automatically gives 
rise to a notion of dimension in the usual way. If 5 C r(if)^, then it is easy to check that 
upper Minkowski dimension, which dominates Hausdorff dimension, satisfies 

dim^^ = hmsup (log \S/iT^''^f\/\og |r„|) . (10) 

n— >oo 

The 1-dimensional example above easily generalizes to show that the kernel of a fc-letter 
word map can have Hausdorff dimension arbitrarily close to k. However, the kernel of a map 
cannot be full- dimensional, as the following restatement of Theorem ^2 shows. 

Theorem 4.4. Let w be a nontrivial word in the letters gi, . . . ,gk, and let 

lC = {{g,,...,g,)eT{Hf ■.w = l}. 
Then dimH(/C) < dmijj{lC) < k. 

Proof. The first inequality holds in every metric space. We first make the assumption 
(i) that there exists elements in H for which w ^ 1. Let ^ denote the length of the word 
(i.e., the sum of the absolute values of the exponents), and let h = \X\. Fix the action 

= (s'j/r*-""^-'; 1 < i < fc) of the variables on T„_i, let s{An) denote the number of liftings 
of these elements to r„ for which w = 1. Let h = \H\, and let t„ = h^^" denote the total 
number of liftings. Let s„ = maxA„ s{An). Then we have 

|/c/(r("))'=| <si...s„. 
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and therefore by (fTUj) 



dinij^j K, < k lim sup ^ ^ < lim sup ^ 



log()f:i . . . t„) login' 

The last inequality is simple arithmetic. 

Since the action on T„_i is fixed, for v G dTn-i we have that w{v) is a word in the 
letters gi{u), u e dTn-i- More precisely, we get w{v) by replacing each occurrence of Qi 
in by some gi{u), where the u depend on the action An and need not be the same for 
different occurrences of Qi. Together with our assumption (i), this implies that there is a 
counterexample to w{v) in H. 

We claim that there exists a large subset S C dTn-i so that if m, f G S", then the 
words w{y) have no letter in common. Indeed, each word uses at most £ letters, and 

conversely, each letter appears in at most I words (more precisely, the number of words using 
gi{u) is at most the total number of times is used in w). This means that the graph in 
which u, V are neighbors iff w{u), w{v) share a letter has maximal degree at most d = 
(ignoring loops). Such a graph has an independent set S of size at least \\dTn-i\/ {d + 1)] 
(using the greedy algorithm). 

Let f G S* and consider the set of letters L that contribute to w{v). The number of 
assignments of values in H to the letters L is /i'^'; the number of assignments for which 
w{v) = 1 is at most h^^^ — 1. Since these set of letters are disjoint for different v, we have 
that 

siAn)<Uih'-l)/h'f\ 

and therefore with e = log{h^/{h^ — 1)), we get 

logs„ < logt„ - £67(rf + !)<(!- e') logt,, 

where e' > does not depend on n. This implies the claim of the proposition when assump- 
tion (i) holds. Otherwise, for some finite j the group H' = ^i ( H) ha s a counterexample, since 
T(H) contains a free subgroup of rank k (see iBhattacharied ()l995| ) and Corollarv 14.21 here) . 
Since the natural isomorphism between r{H)^ and r{H')'' preserves full- dimensionality, the 
proposition applied to H' concludes the proof. □ 

Remark 4.5. We have shown that the pre- image of 1 G r(if) is not full-dimensional. It is 
easy to modify this proof to get that the pre-image of any point g G r(if) has dimension at 
most 1 — e'{w), where e'{w) does not depend on g. 

Corollary 4.6. Let G C T{H) be a subgroup of Haus dor ff dimension 1. Then k random 
elements generate a free subgroup with probability 1. 

Proof. It suffices to show that for each word w in k letters, the probability that w = 1 
in G is zero. This probability equals the measure of Kyj in G^ . But G^ is a full-dimensional 
subgroup of r[H)'^, so every subset of G^ with positive measure has full dimension in r{H)^. 

□ 
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Let G be a finite p-group. Our study of random generation requires a sufficient condition 
for a set of randomly evaluated words to have independent uniform distribution on G. For 
a word w in the variables Qi define the exponent sum vector of w as the vector whose 
coordinate i is the sum of the exponents of gi'mw. Let w denote the evaluation of the word 
w. We have not been able to locate the proof of the following simple lemma in the literature. 

Lemma 4.7 (Linear and probabilistic independence). Let Wi, . . . ,Wk be words in the 
letters gi, . . . , gn+m, so that the exponent sum vectors restricted to the first n coordinates are 
linearly independent mod p. Fix gn+i, ■ ■ ■ gn+m ^ G- Then the substitution map G" — > G^, 

(51, ■■■,9n) ^ (wi, ...,Wk) 

covers evenly, i.e. it is \G\^~^ -to-one and onto. 

A special case of Lemma f4. 71 savs that if w is a word in which the exponent sum of some 
letter is relatively prime to the order of a finite p-group G, then the evaluation map covers G 
evenly. We state without proof that this also holds for nilpotent groups but not for arbitrary 
finite groups. 

Proof. Assume that the gi,...,gn are chosen independent uniformly and at random 
(i.u.). It suffices to show that the evaluations Wi{g) are i.u. We prove this by induction; it 
is clearly true for G = 1. Let Z < G be a subgroup of order p of the center of G. Let R be 
a set of coset representatives of Z, and let r : G — >^ i? be a coset representative map. 

Write gi = Zir{gi). Then Zi & Z, ri & R are i.u. for i < n. Because the Zi are in the 
center, we have 

Wi{g) = Wi{z)wi{r{g)) = Wi{z)hir{wi{g)) 

Where hi & Z and the random vector h depends on r{g), but not on z. The assumption 
implies that Wi{z) are i.u., and therefore given 6, Wi{z)bi are i.u. Thus TUi are products of i.u. 
elements of Z and i.u. coset representatives, and these 2n random variables are also jointly 
independent. The claim follows. □ 

By considering finite quotients, we see that the assumption of Lemma 14.71 implies that 
for any pio-p group G the substitution map is a measure-preserving surjection. 

5 Schreier graphs and homology 

The goal of this section is to establish some tools for proving Theorem El about random 
generation. 

Let i^' be a p-group acting on a set X, and let S = {gi, . . . , gi} be a multi-set of elements 
of K. The Schreier graph Q = Q{X,S) is a directed multi-graph whose vertex set is X, 
and the edge set is X x 5*, where the edge {v,g) connects v to v^. 

Let tQ denote the vector space of 1-chains, that is the set of abstract linear combinations 
of the edges of Q with coefficients in ¥p. 

If w is a word in S, i.e. an element of the free group indexed by S, then for each f G X, 
w{v) can be thought of as a path {v, f"""), where the wi, . . . , w„ are the initial sub-words 
of w and w denotes evaluation in K. 
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For a path vr, let ttt G tQ denote the 1-chain of tt where each edge appears with a 
coefficient given by the number of times it is used in vr. 

Recall that the exponent sum vector of w is the vector whose coordinate i is the sum 
of the exponents of Qi in w. 

Lemma 5.1 (Independent 1-chains). Let W be a set of words in whose exponent sum 
vectors are linearly independent. Then {rw{v) : w &W,v & X} are linearly independent. 

More generally, let W be a set of words in S whose exponent sum vectors restricted to 
5" C 5* are linearly independent. Then {tw{v) : w eW,v E X} restricted to S' -edges are 
linearly independent. 

Proof. We prove the more general version. Let S = {gi, . . . , g£}, let n < i, and let 
5" = {gi, . . . ,gn}- Without loss of generality we may extend the set W so that \W\ = n. 
Consider the wreath product D = Cpl K, and consider the evaluation map 

(di,...,dn) 

where for i > n we take the ith letter to be constant di := {0,gi). This map is a bijection 
because of the linear independence assumption and Lemma 14.71 Now restrict the map- 
ping to all possible extensions of {gi, . . . ,gn) G K"' given by the fixed generators so that 
{hi, . . . ,he) I— > {h[, . . . , h'^) where hi = (z/j, gi), h[ = (z/^', g'^). Then the g'i are determined by 
the gi, and the map / : z/ i-^ z/' is an isomorphism of vector spaces fI^I" ^ Fi^^l". If the 
reduced form w E W is ai ■ ■ ■ a^., then for f G X the evaluation in D satisfies 

w{v) = ai(t;)a2K0 " ■ ■aUv'''-^-'). (11) 

Here (v) means coordinate in the wreath product. Since the edges of the subgraph Q(X, S') 
are of the form {{v,gi) : v G X, 1 < i < n}, which agree with the coordinates of u, we may 
think of z/ as a 1-chain for Q{X, S'). Then (fTT|) implies that for w G W, f G X we have 

f{u){w, v) = w{v) = t'w{v) ■ z/, 

where t'w{v) is the restriction of tw{v) to S"-edges, and ■ is the natural scalar product of 
1-chains in Q{X, S'). Since / is bijective, the statement of the lemma follows. □ 

The subspace of tQ generated by ttt for cycles vr is called the (first) homology group 
TiQ of Q. For a word w and f G X let w{ov) denote the path {w^){v), where i is the smallest 
positive power so that fixes v. Note that the path w{ov) is a cycle, and so tw{ov) G HQ. 

The following lemma is important for our study of random generation. We will need to 
show that the homology of certain cycles in a Schreier graph of a p-group K with 3 variables 
S = {91,92,93} is rich enough. We first need to make some definitions. 

Definition 5.2. For f G X, let k,{v) denote the smallest positive power of p so that there 
exists a word w in {gi,g2,9* = 9^} so that = v and the exponent sum of g^, in w is not 
divisible by p. Let vx denote a vertex for which k is minimal, let wx denote the corresponding 
word, and let k(X) = k{vx)- 
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Assume that 

9* = 93^^^ fixes no point in X. (12) 

For a set of words W, and an element / of the free group J-'w indexed by W, let Wf denote 
the concatenation of words w & W according to /. 

Lemma 5.3 (Words with special homology). Assume il^) . There exists a set W of 
|^j9i,92> I _j_ ^ ^Qj^dg j^fi g fixing vx, so that for each fixed f G !Fw fixed u ^ X the 1-chains 

T{wxWf){ou), {tw{vx) '■ weWU{wx}} 
are linearly independent. 

Proof. Let W be the generating set for the fundamental group of the Schreier graph 
Gi2 = G{K,v^^^'^^\ {gi, g2}) defined in the proof of Lemma EiH i.e. pick a spanning tree 
rooted at vx, and for each edge {vw,gi^) outside the tree consider the path w{vx) that 
moves from vx to in the tree, then along the edge, and finally back to vx in the tree. 

We need to check that if 

auT{wxWf){ou) + axT{wxivx)) + ^ a^T{w{vx)) = (modp) 

then all the coefficients equal (modp). Indeed, by Lemma FS .41 below, the first two terms 
are linearly independent when restricted to g^ edges, and the terms in the last sum vanish 
on g^-edges, thus we have a+ = a„ = 0. It is standard that rW are independent: they form 
a basis for the homology group HQu- Specifically, the edge {vw,gi^) defined in the previous 
paragraph appears in exactly one of r(t>x,u'), so = 0. □ 

We used the following lemma in the proof above. Let t^tt denote the 1-chain of vr restricted 
to (73-edges of Q. For a {(71, (yf2, (7*}-path vr, let r^vr denote the 1-chain in Q{X, {gi, g2, g<t}) 
restricted to g^-edges. 

If w is word in which the exponent sum of g^ is not divisible by p, then it follows from 
Lemma l5.ll that {t^wIu) : u G X} are linearly independent. The ideal situation is when 
there is such a word satisfying v'^ = v. But it can happen that there is no such word, so we 
need the following lemma. 

Lemma 5.4 (Independence for added noise). Assume M^) . and let u G X\{vx}- The 
vectors T^^wxivx)), T3(wx(ow)) are linearly independent. 

Proof. Let k = k{X), v = vx, w = wx, and let g* = g^. Consider the partition of X 
into {(71, (y'2, 5'*}-orbits Q. For x G X, let Qx ^ Q denote the orbit of x. 

Let Q E Q. We claim that the sets Q, Q^^, . . . , Q^s are disjoint. Suppose the contrary, 
then for x,z E Q, we have x^s = with i < j. Then x = z^s . Let w^z be the word 
in {gi,g2,9*} so that a;"'"^^ = z. Then the evaluation of the word Wxz9i~^ fixes z and its 
^^a-exponent sum is not divisible by k. But by definition k is minimal at fx, a contradiction. 

Consider the homomorphism (f of 1-chains defined by 

<P : rgiQ,{g,}) rgiX,{gs}) 
ng^{x) ^ r^g^ix) 
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which maps the the 1-chain of a 5f^,-edge to the 1-chain of its expansion, a path of length k. 
By the previous paragraph, for different x,z & Q the paths g*{x), g^{z) in Q{X, {gz}) have 
disjoint edges. It follows that ip is injective. 

Since the exponent sum of g^ in w is not divisible by p, we can apply Lemma 15.11 to 
the Schreier graph Q{Q, {gi, g2, g*}) and 5" = {(?*}. We get that {r*(w(a;)) : x G Q} are 
linearly independent. Since is an injective homomorphism, {t^{w{x)) : x G Q} are linearly 
independent. 

Let X G X and let q denote the length of the cycle of x for the action of w. Then 

q-l 

T,{w{ox)) = Y,^,{w{z'''))^Q (13) 

since the terms are linearly independent. This implies that the vectors t^{w{v)), T3{w{ou)) 
are nonzero. It also implies that if u E Qv, then the claim of the lemma holds. 

Now assume that u ^ Q^. Recall the definition of the boundary operator d. It is a linear 
map from the 1-chains of a graph Q to 0-chains i.e. abstract linear combinations of vertices 
of Q. It is defined by a relation {x^z) z — x for each edge (x, z). 

For X G X, T^w{ox) is a linear combination of 1-chains of paths with length k that start 
and end Q^- Thus dr^wi^ox) is supported on Q^. In particular, dr^wiv), and dr^w^ou) are 
supported on Q^, respectively. It therefore suffices to show that dT3,w{v) ^ 0. 

We first claim that dT^w{v) ^ 0, in other words t^w[v) is not a linear combination of 
1-chains of (7*-cycles. Indeed, all such cycles are of p-power length, and by assumption (fT^ . 
there is no cycle of length 1. But t^w{v) cannot be a linear combination of longer cycles, 
since the exponent sum of g^ in w is not divisible by p. 

Since (/9 is injective, it follows that dTj,w{v) ^ 0, completing the proof. □ 



6 Slices of random subgroups 

The goal of this section is to show that slices of random subgroups of T{p) are large; in the 
next section we will show that these large slices generate a large subgroup. 

The setup is as follows. Let be a finite p-group acting on a set X. 

Let H = r„(p). The group H I K acts on the disjoint union of |X| copies of T„. For a 
subgroup C (H I K), let aG* C denote the boundary slice, that is the pointwise 
stabilizer of the vertices on level n — 1 of the trees. Note that aG^, is a vector space. 

We first consider the case K = 1. The following lemma shows that it is possible to 
generate large slices by two elements, even if one is required to be a high power. 

Lemma 6.1. For < k < n there exists elements s, (? G r„(p) so that 

dim(a(s,/')) =p""i -p^ + 1. 



The following proof uses a technique introduced in lAbert and Virae but is self- 

contained. 
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Proof. Each element v G i9T„_i is represented as a sequence (ai, . . . , a„_i) of elements 
of Fp. We assign to each such element a number N{v) = oi + + . . . + anP""'^- Recall that 
there exists an element, the adding machine s G T{p), that acts as a full cycle on 9T„„i so 
that N{v') = N{v) + 1 (modp"-^). 

Each g G aTn we identify with the polynomial in F„ := Fp[x]/ (x^" — 1) given by 

this is an isomorphism of vector spaces, and by the above, the action of s on crF^ corresponds 
to multiplication by x in F„. Define the weight z/(/) of an element / G -F„ as the highest 
k so that = {x — 1)^ divides /. Then it is clear that elements of -F„ of different weight 
are linearly independent. Also — /) = i^{yf) = i^{f) + 1, unless = p^~^ — 1 (since 
yP" = x^" —1 = 0). This implies that if a s-invariant subspace M of crr„ contains an 
element of weight u, then it also contains an element of weight z/ + 1, then one of z/ + 2, up 
to weight — 1. Hence dimM > p^~^ — v. 

Consider the element (7 G r„ with the following properties. The action of g agrees with 
s on dTk- Also, for vertices v at levels greater than /c, g{v) = 0, except that for the vertex v 
at level n — 1 with N{v) = we have g{v) = 1. 

Let q = p^. One easily checks that g'^ G o-F^. Moreover, for v G 5T„_i we have that ) 
equals 1 if N{v) < q and otherwise. Thus g'^ corresponds to the polynomial 

/ = 1 + X + . . . + x^-i = y^-^ 

of weight q — 1 (the last equality can be checked via binomial expansion). It follows that 
g has weight q — 1, and therefore the s-invariant subspace containing it has elements of all 
weights in g — 1 . . .p""^ — 1. The claim of the lemma follows. □ 

Now let K = {gi, €12,93)- Define k, vx, wx as in Definition 15.21 (with tildes removed), 
and assume El Consider the subgroup G H I K generated by uniform random liftings gi 
of the generators gi. Let e denote the probability that |t)^5i.92>| _|_ i random elements do not 
generate H. 

Lemma 6.2 (Controlled randomness in wreath products). There exist a word w in 
the gi whose composition depends on the gi so that that Vx = vx, 'P{w{vx) = h) > 1 — e, 
and for n G X \ {vx} the distribution ofw{ou) is uniform on H . 

Proof of Lemma 16.21 With the notation and the results of Lemma (5.31 consider the 
group generated by {w{yx) '■ w G W}, with probability 1 — e this is the full group H. Then 
for some concatenation of elements of W we have w+Wf{vx) = h (otherwise we pick Wf 
arbitrarily) . 

The linear independence statement of Lemma l^^ translates to probabilistic independence 
by Lemma 1^171 Therefore, for each fixed /, m 7^ fx G X, given {w{vx) : w E W U {wy}}, 
w+Wf{u) has uniform distribution on H. This is also true for our random choice of / as it 
only depends on the values {w{v) : w G W+} (see Remark l6.3|) . □ 
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Remark 6.3. In the last paragraph of the proof we used the following simple fact. Let 
X, {Yi, i G /} be random variables so that for each i & I, the two variables X, Yi are 
independent and Y^ has distribution /i. Let f{x) be an /-valued deterministic function. 
Then X, Yf(^x) are also independent and ^/(x) has distribution /x. 

The rigid vertex stabilizer TZ^^G C G of a vertex v is the pointwise stabilizer of the 
vertices not descendant to v. 

Lemma 6.4 (Slices for a descendant tree of a vertex). There exists q > depending 
on p only so that if i := |t)^5i.92)| ^ pcon ^^^^ event 

dimaU^^G > (1 dimaF^ (14) 

has probability at least 1 — e^'^'^^i — e'^^*^""^^ . 

Proof. Let v = vx, let G^ G denote the stabilizer of v, let Gv+ = {h E G^ : h{v) = 
1}, and let vr denote the quotient map G^ Gv/Gv+ = H. Fix ci > and let ni = [cin\. 

Consider the two elements g, s used in Lemma 16.11 for generating a large slice. Using 
Lemma lfj.2l twice, with high probability we generate elements g' s' G Gy, so that irg' = g, 
TTS = g. If h' = g^"^ then h = nh' G crTn, and the smallest s-invariant subspace M of aTn 
containing h has 

dimM >p"-^ -p"^ + 1 (15) 

We will show that with high probability, h G TZyG, and thus so are its s-conjugates. 
Therefore the subspace M has an isomorphic pre-image C GmZ^G. The inequality (fT5|) 
implies the claim ()14|1 . 

Indeed, h{u) for m G X \ {v} is the p"^~''-th power of a uniform random element, where 
p"^ is the length of the orbit of u under h. By Theorem 12.121 if C5 < 1 is large enough, 
then there exist C2 > so that for all n and a uniformly chosen random Hq E H has 
P(^o ^ 7^ 1) ^ e"'^^". With the right choice of cq, ci, we get that 

P {h{u) ^ 1 for some ueX\ {v}) < p^e-"^'^. 

The probability that Lemma do es not give the right h is bounded above by the chance 
q that p™" + 1 random elements do not generate r„ (or, equivalently, its Frattini quotient 
Fp). It is known that q < e^^*^""^™^ where C3 is an absolute constant. Since we use Lemma 
16.21 twice, we can increase C3 to include a factor of 2. □ 

Let r denote the number of orbits of K on X, and let i denote the size of the shortest 
orbit. 

Lemma 6.5 (Slices of random subgroups are large). There exists co,ci, 02,03 > 
depending on p only so that if £ < p^"" then the event 

dimaG > {1- p-^'^'^^) dim a{H I K) (16) 

has probability at least 1 — r{e~^'^"i — e^^^^~^^). 
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Proof. For a ii'-orbit R of X, let k,{X) = k{vr) as in Definition 15. 2[ Let -Ri, . . . , Rr 
denote the i^-orbits of X, listed so that i^{Rt) is increasing in C.. Let = i?^ U . . . U i?^, and 
let denote the subgroup of D = H I K which fixes all descendants of vertices in X^, and 
let Ge = De (1 G. Then we have 

1 = Di > . . . > Dr+i = D, 

and similarly 

1 = Gi > . . . > Gr+l = G. 

Let A = cr(r„ I K) be the subgroup fixing all vertices at level n — 1, a vector space. Then 
we have 

r 

dim(aG') = J2 dim {aGe+i / aGe) . (17) 
e=i 

Note that the subgroup G/Ge C D/D^ acts naturally on Xi, moreover, the Qi/Dg are uniform 
random liftings of the gi/{K fl Di) to D / D^. 

We can apply Lemma 16.41 to this action with X = Xi, vx = ^ij^, since vr^ minimizes 
K over vxi- We get a large boundary shce of the subgroup IZ^^D/De) = 7lv{De^i/ Dg). 
Conjugation by Gg+i/Ge gives the |i?f|-fold direct product of this large slice (one copy for 
each vertex u G Re). Therefore with high probability 

dim{aGe+i/aGe) > \Ri\ {1 - p-^'^^^) dimffL^ 

since Lemma 16.41 is used r times, the probability of success is bounded below by the stated 
amount. The claim of the Lemma now follows from (jl7j) and the straightforward identity 
dima(Tn I K) = \X\ dimaF^. □ 



7 Dimension of random subgroups 



The notion of Hausdorff dimension was introduced bv lAbercrombid ( 1994 ) to the setting 
of compact groups. Let n be the greatest number so that g G T^"'\ Define the norm and 
distance 

\\g\\ = \T^\-\ distig,h):=\\gh-'\\. (18) 

This turns T{H) into a compact metric space, and Hausdorff dime r ision is defined in the usual 
way. Translated to our setting the result of iBarnea and ShalevI (|1997|) says that for closed 
subgroups G C T[H) Hausdorff dimension agrees with lower Minkowski (box) dimension. 
Considering what balls are in T{H), this is easily checked to coincide with the lim inf of the 
density sequence 7^ defined in the introduction (^. For technical purposes, we will now 
introduce 7^ in a slightly more general setting, for the wreath product of F(p) and a finite 
j9-group. 

Let Kq be a p-group acting on a set Xq. Let F^,, = T{p)l Kq acting on |Xo| copies of the 
infinite p-ary tree Tp. Let F^^ denote the subgroup that stabilizes all vertices at level £ of 
all trees. Given a subgroup G C F+, define the density sequence 



liiG) 



iog|g/r 

log|F+/F 
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The density sequence 7^ defined in (jT)) covers the case when Kq is the trivial group acting 
on a singe-element set. In general, we have 



dimH(G) = liminf 7£(G). 

Proposition 7.1. Let G be the the subgroup generated by independent uniform random 
liftings of three fixed elements of Kq to r+ = T{p) I Kq. With probability 1, the density 
sequence 'ji{G) satisfies 

^e{G) > 1 - e-^^ 

for c = c{p) > fixed and all sufficiently large i. In particular, dimH(G') = 1. 

The most important case is when Kq acts trivially on the single-element set Xq. 

Theorem 7.2. The subgroup ofT{p) generated by three random elements has 1-dimensional 
closure with probability 1. 

These random subgroups are the first known examples of finitely generated subgroups of 
Tip) with full dimensional closure. We propose 



Conjecture 7.3. Theorem \ 1. ij| holds even for two random elements. 

Another immediate corollary of Proposition 17.11 is 

Theorem |31 Let e > 0. Let Gn be the subgroup generated by three random elements of the 
symmetric p-group r„(p). Then P ( |G„| > |r„(p)|^~'^) ^1 as n ^ 00. 

This theorem is the p-group analogy of the famous result of DixonI ()l969l ) saying that two 



random elements of Sym(n) generate Sym(n) or the alternating group Alt(n) with probability 
tending to 1. In fact, for finite simple g r oups (j, the probability that two elements generate 
G tends to 1 as IGj — 00, see ShalevI ()l999l ). The methods used here are fundamentally 



different from the usual subgroup counting technique. 

Another interesting consequence of the Proposition 17.11 is the following strengthening of 
Theorem |21 

Theorem 7.4. For each d G [0, 1] there exists a topologically finitely generated free pro-p 
subgroup ofr{p) of Hausdorff dimension d. 



This result leads to the solution of a problem of Shalev ( 20001 ) . who asked whether a 



topologically finitely generated pro-p group can contain topologically finitely generated sub- 
groups of irrational Hausdorff dimension. Indeed, we may take three elements generating a 
full-dimensional subgroup Gi C r(p), and three more elements that generate a (i-dimensional 
subgroup Gd C T{p). It is straightforward to check that the dimension of Gd in {Gi,Gd) 
with respect to the filtration inherited from T[p) is d. 

Let T' be a subtree of T which has the same root as T and each vertex has or p offspring. 
Let Tt' ^ r(p) be the pointwise stabilizer of T'. The measure fi{T') of T' can be defined as 
lim |T^|/|T„|, where means the vertices at level n. The sequence is non-increasing, so the 
limit always exists. It is easy to check that 

dimuiTT') = l-fiiT'). 
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Let Vq denote the set of vertices of T' that have no children. Since n{T') can take any value 
in [0, 1) (even if we assume that Vq is infinite), dimn T^' can take any value in (0, 1]. In the 
following we show that random subgroups G of T^' are free and dimH(G) = dimn (Ft-/) with 
probability 1. This proves Theorem 17.41 for d > 0; the easy case c? = is left to the reader. 

Proposition 7.5. The subgroup G topologically generated by k > 3 random elements o/Tt' 
satisfies 

dimH(G) = dimH(rT') = 1 - /i(T') 
with probability 1. IfVo is infinite, then G is a free pro-p group with probability 1. 

Proof of Proposition I7.5L Let Vo+ denote the vertices Vq and their descendants in 
T. Clearly, for g G Fj-/ and v & V \ Vo+ we have g{v) = 0. Moreover, Haar measure on F^' 
agrees with picking g{v) uniformly, independently at random for each v G Vo+. 

Let gi denote the random elements, let e > 0, and let i be so that \diT'\/\diT\ < ji{T') + e. 
Condition on the action Kq of gi on Xq = dTi \ dT^. By the first paragraph, given this 
information, the gi are uniform random liftings. Proposition 17.11 applied to this action, with 
dimension appropriately rescaled, implies 

dimn G > \d,T \ deTy\deT\ > 1 - /i(T') - e. 

With probability 1, This is then true for all rational e, so dimH(G) > dimH(Fr/), and the 
other inequality is trivial. 

For the vertices f G Vq the action of G on the descendant subtrees of v are independent, 
generated by k independent random elements. If Vq is infinite, then each finite p-group that 
is generated by k elements appears with probability 1 as a quotient group of the action on 
one of the descendant subtrees. Thus each finite p-group generated by k random elements 
appears as a quotient group of G with probability 1, which implies that G is a free pro-p 
group with probability 1. □ 

We now proceed to prove the key proposition. The proof uses almost all theorems proved 
so far, in particular, it depends crucially on the results of Sections El and El 

Proof of Proposition 17. 1L For £ > 1 let m = m{i) ■= [(log£)^J, let n ■= i-m, and 
condition on the values of g^ := 5'j/F'j!"'* G F^ I Kq. Given this information, the gi/T^^^ are 
uniform random liftings of g[ to F„ I (F^ I Kq) = T^l Kq. Let denote the set of vertices 
at level m in the trees. Let b > 1, and assume that the following event Afy^m holds. Let 
Gi2 = {91,92)- For the action of G on X^'- 

(i) the number of orbits of {Gi2,Xm) is at most b, 

(ii) the shortest orbit of {Gi2,Xm) has length at least p"Y6, 

(iii) if > b, then g^^^"^^ has no fixed points. 

We now apply Lemma f6. 51 with X = Xm, K = Tm I Kq here. Let Bi be the event that 

s, := dim ((G n fJ-^)) / F?) > p'-\l - p-^'^'), 

call this event B^. Lemma (6.51 implies P{B^\Af,fn) < Q-<^4i+c5 ^^j. gome constants C4, C5 > 
depending on b, p. 
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By Proposition 13.91 for all large enough m the children of any G'12-orbit of form a 
Gi2-orbit of X^+i- Therefore assumptions (i)-(ii) hold for some random b and all m with 
probability 1. Also, we get that if is a word in the generators fixing v G X^, then for 
some {(71, (72}-"word w' all children of v are fixed by ww'. Therefore n{Xm+i) < i^{Xm-i), so 
K{Xm) < b" for some random b" and all m. 

Every random word in T{p) fixes only finitely many fixed vertices of T (Proposition^^, 
so (iii) holds for some random b and all m with probability 1. Thus if we define 

m>l 

then Ab is increasing in b and P{Ab) 1. 
Now we have 

P{Be n Ab) < PiBe n A^^b) < P(5,|A,,6) 

Since the latter is summable, we get that for fixed b, Bi fl Ab happens for at most finitely 
many i with probability 1. Taking the increasing union as 6 ^ 00 we get that B^ happens 
for at most finitely many i with probability 1. Then 

e I 

log,|G/r?| = ^.,> J]|Xo|/-^(i-p-«0 

> \Xo\ - e— ) = log, |r+/r?|(i - ) 

where the last inequality holds for large enough £ if c < ce \ogp. The claim of the proposition 
follows. □ 

We conclude this section with two conjectures of increasing strength. 

Conjecture 7.6. Let G he a closed subgroup ofT{p). Then G contains a topologically finitely 
generated subgroup of the same Hausdorff dimension. 

It may be that such subgroups are abundant. An affirmative answer to the following 
question would imply Coniectures 17.61 and 17.31 

Question 7.7. Let G be a closed subgroup ofr{p). Let gi,g2 be random elements chosen 
according to Haar measure on G. Is it always true that dimH((5'i, (72)) = dimH(G) with 
probability 1? 



8 Small subgroups 

In this section we state general results on small subgroups of T{p). 

First we give a description on the Abelian subgroups of T{p) using orbit trees. A rooted 
tree is a 1-p tree if all the vertices have 1 or p children. We call a vertex solo, if it has 1 
child. For an arbitrary rooted tree R let S{R) denote the number of solo vertices of R. We 
say that a group A C r„(p) belongs to if i? = Tn/A. 
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Lemma 8.1. Let R be a 1-p tree of depth n and let A C r„(p) an Abelian subgroup which 
belongs to R. Then log^ \ A\ < S{R). Equality holds if and only if A is a maximal Abelian 
subgroup which belongs to R. 

Proof. We use induction on n. For n = 1 the proposition is trivial. Suppose it holds 
for n — 1. 

Let B O Ahe the stabilizer of level 1 of T„. Then B has index 1 or p in A. Let Ai, . . . , Ap 
denote the subtrees of T„ starting from a vertex of level 1. Let Bi denote the restriction of 
the action of B to Aj and let \Bi\. 

li A = B (i.e., if the root of R is not solo), then A maps each Aj onto itself so we have 

p 

i=l 

Using induction we get 

p p 

J2k,<Y,S{A^/B,) = S{R). 

i=l i=l 

li \ A : B\ = p (i.e., the root is solo), let (7 G A \ 5 be an arbitrary element. Then G B, 
so the action of g on T^/B has order p. This means that g induces graph isomorphisms 
between the trees A^/Ei, so S{T^/B) = pS{Ki/B{) and S{R) = S{Ai/Bi) + 1. Now let 
6 G -B be any element which lies in the kernel of the restriction of 5 to Ai, i.e., fixes Ai 
pointwise. Since g permutes the Aj transitively and commutes with b, it means that b fixes 
all the Aj pointwise, that is, b is the identity. Thus B is isomorphic to Bi. So using induction 

If the equality \A\ = p^^^^ holds, then A is trivially maximal with the property R = Tn/A. 
The other direction follows by induction the same way as the inequality above. □ 

As a corollary, one can show that Abelian subgroups of T{p) are zero-dimensional. Theo- 
rem IHl gives a more general result in this direction. Moreover, Theorem El shows that Abelian 
groups are also small-dimensional as sections: the derived subgroup of a closed subgroup 
G C r(p) has the same dimension as G. 

For Theorem El we need a general asymptotic result on rooted trees. Let > 2 be an 
integer. A rooted tree T is defined to be a li-bounded tree if every vertex has at most d 
children. It is a 1-d tree if every vertex has either 1 or d children. Let U he a subset of a 
(i-bounded tree T. We define the density sequence of U by 

where rn{U) is the number of vertices in U of level n. We define the density 

6{U) = lim 6n{U) 

n— >oo 

if this limit exists. In particular, S{T) always exists, since 6n{T) is monotone decreasing. 
The following straightforward lemma will be useful. 
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Lemma 8.2. Let T be a d-hounded tree and let U he the set of vertices of degree less than d. 
Then Xli^o — ^i^ " ^(^))- ^'^ particular, S{U) = 0. // in addition T is a 1-d tree then 

oo , 
Y.^^iU) = J^{l-S{T)). 
i=0 

Proof. For the first claim, we may assume that all the vertices have degree d or d — 1. 
Then ri{U) = dri{T) — ri^i{T). From this, using ro(T) = 1 we have 



n n ^ 

i=0 i=0 ^ 



d' rf^+i / V c/"+i 



which gives YllZi^ii^) ~ ^(^ ~ ^i'^))- If every vertex has degree 1 or ci then we have 
fi{U) = {dri{T) — r.j4_i(T)) /{d — 1) and the last statement of the lemma follows. □ 

Now we are ready to prove the "perfectness" theorem, restated here. 

Theorem [5l Let G C r(p) be a closed subgroup. Then we have lim(7„(G') — 7n(G'')) = 0. 
In particular, dimn G = dimn G' . 

Proof. Let F = T{p)/G be the orbit tree of the action of G on T{p). We treat F and 
T{p) as 1-p trees. Fix positive integers d and k. 

We partition the vertices of F into three classes as follows. Let A be the set of orbits 
which have exactly p^ k-ih cousins in F including themselves. Let B be the set of orbits of 
size at least p'' and let C be the set of remaining vertices. 

We claim that 5{BUC) = 0. To see this, let us define a new graph F' on the vertices on 
F by putting an edge between u and u if m is the fc-th grandson of v. Then F' is the union 
of k disjoint p^-bounded trees. Now let D be the set of F'-parents of elements m. B U C. 
Then D is the set of vertices in F' which has less than p^ children, so by Lemma 18.21 D has 
zero density in each of the components of F' . But then B U C has zero density in F . 

For an arbitrary p-group P acting on a finite set X of size m let us define the commu- 
tator density by c(P) = logp \ P : P'\. We use the following facts on c(P). 



m. 



(a) Comparing P to the Sylow p-subgroup we have c(P) < logp |Sylp(Sym(m)) | < 

(b) If P is transitive on X then c(P) < Km/ \/l oKm where K is an absolute constant. 
This is a result of Kovacs and Newman ( 1988| ) . 

(c) If P is intransitive and X = [Jl^^ Xi such that each Xj is P-invariant, then we have 
c(P) < J2i=i c(-Pj) where Pj is the action of P on Xj. 

(d) If P acts diagonally on the Xi, i.e., when elements of P fixing any of the Xi pointwise 
must fix the whole X, then c(P) = c(Pi) . 
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Now let n be an arbitrary level and let A„, Bn and C„ be the set of vertices of T which 
belong to A, B and C respectively. Let G„ be the action of G on the n-th level of T and let 
Gq, Gfe and Gc be the action of G„ on A^, _B„ and 

The orbits lying in A can be partitioned into sets of size such that the action of Ga 
on them is diagonal. Then (d), (c) and (a) imply 

Using (b) and (c) we have 

c{Gb) < K \Bn\ /Vd < Kp^'/Vd, 
since all the orbits lying in B have size at least p'^. Lastly (a) implies 

c(Ge) < \G„\ < 5„(G)p"+'^-\ 

since all the orbits in G„ have size at most p'^^^. Combining the above, (c) on An, Bn and 
Gn implies 

c(G„)p-" < p-' + K/Vd + (5„(G)/-^ 

and letting n — > oo we get 

lim sup c(Gn)p-" < p-^ + K/Vd + 5(G)/-^ (19) 

71— »00 

where S{G) < S{B U G) =0. The inequality p9|) holds for every positive k and d, so 
c{Gn)p~^ — > 0, and the proof is complete. □ 

As an immediate corollary, solvable subgroups are zero-dimensional. Using a Baire cate- 
gory argument we obtain the following. 

Corollary 8.3. Let G C T{p) he a positive- dimensional subgroup. Then G cannot he ab- 
stractly generated by countably many solvable subgroups. 

Proof. Suppose that Hi, H2, . . . C G are solvable subgroups generating G as an abstract 
group. Since closure preserves solvability, we can assume that the Hi are closed. Then 

G = \^ Hni Hn2 ■ ■ ■ Hn^ 

k,nieN 

is the countable union of the finite products formed from the Hi. The sets Hn^Hn^ ■ ■ ■ Hn^. are 
closed, so by the Baire category argument one of them has to contain an open set U. Then U 
contains a coset of an open subgroup in G, so dimn Hn^Hn^ ■ ■ ■ Hn,, > dimn U = dimn G > 0. 
By Theorem El -ffrij are zero-dimensional, moreover, lim7„(if„.) = 0. It is easy to see that 



7i 



i{Hn^Hn2 ■ ■ -HnJ < ^Jn{Hn,] 



1=1 



for all n. This implies dimn Hn^Hn2 ■ ■ ■ Hn^ = 0, a contradiction. □ 
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Note that the full T{p) can be easily generated topologically by an element and an Abelian 
subgroup. 

Since solvable subgroups have dimension zero, we need to find a more refined way to 
measure how large they are. This can be done by summing the density sequence rather than 
taking its limit, as Theorem IHl shows. 

Theorem |H1 Let G ^ r{p) be a solvable subgroup with solvable length d. Then we have 
Yl°^=oln{G) < Cd where C is a constant depending on p only. 

The example r„ C T{p) shows that this is the best possible bound up to the constant 
factor. 

Proof. We can directly deduce this theorem from Lemma l8.ll and Lemma 18.21 in the 
case when G is Abelian. 

Instead of 7„(G), it is more convenient to first estimate a slightly different density. Let 

ln{G) = {p- 1)^^^, Sn{G) = j^^AG). 

^ 1=0 

Let = max{s„(if) | H C r„(p) with solvable length d}. 

Let Ai, . . . , Ap denote the subtrees of r„ starting from level 1. Let K (1 H denote the 
stabilizer of level 1. Either K = H or \H : K\ = p. Let Kj denote the action of K on Aj. 
The groups Kj are d-solvable so Sn-i{Kj) < m„_i_rf for all j. 

li H = K then we have 

p 

Sn{H) < Sn-l{K,)/p < mn-l,d. (20) 

i=i 

li \H : K\ = p then let A denote the restriction of K to the set A2 U . . . U A„ and let N (1 Ki 
be the kernel of this restriction, i.e., the set of elements which act trivially outside Ai. Let 
h G H\K be an element moving the first level and let D = [N, h] . Then h moves all the 
vertices of the first level so the restriction of D to Ai is again A^. Hence from D (1 G' we see 
that N is d — 1-solvable. Now \H\ = p \N\ \A\ and \A\ < \K2\ ■ ■ ■ \Kp\ so we have 



from which we get 



i=2 



p 1 

Sn{H) < + - {mn-14-i + {p- l)mn-14) . (21) 

p — I p 

Summarizing, we have rrin.o = and a recursive upper bound on rrin^d given by the 
maximum of the right hand side of (j^UI) and ()21|) . From this the bound m„_d ^ dp'^/ {p — 1) 
easily follows. On the other hand, trivially |7n(G') — 7„(G')| < 1/p"' which yields 



J2^niG)<Cd 



n=0 



with C = (p2 _ 1). □ 
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A possible generalization of the result on the zero-dimensionality of solvable groups would 
be that every subgroup of r(p) which satisfies a nontrivial identity is zero- dimensional. Note 
that Conjecture |H1 would imply this even for pro-p identities. Another possible direction 
arises from the observation that linear groups tend to be zero- dimensional in T{p). 

Conjecture 8.4. Let R be a commutative pro-p ring and let G C GLn{R) be a linear group 
over R. Then for every embedding of G into T{p) the image of G has zero dimension. 

It is easy to see that the conjecture holds for R = Z^: in this case G is jo- adic analytic and 



so is the product of finitely many procyclic subgroups (IPixon et al.1 (Il99lh. This question 
is also related to Conjecture |H1 in the sense that by Barnea and Larsen (|l999[ ) nonabelian 
free pro-p groups are not linear over local fields, so Conjecture |H1 implies Conjecture 
the case when i? is a local field. 



m 



9 Large subgroups 

In this section we analyze 1-dimensional subgroups of T{p). The main results are that 
spherically transitive 1-dimensional groups have normal spectra {0, 1} and that they contain 
nonabelian free pro-p subgroups as well as dense free subgroups. Recall that the normal 
spectra of a group G is the set of possible Hausdorff dimensions of normal subgroups of G. 

One of the natural sources of interesting subgroups in T{p) are just infinite pro-p groups. 
A group G is just infinite if every proper quotient of G is finite; G is hereditarily just infinite 
if all subgroups of G of finite index are just infinite. Just infinite pro-p groups are regarded 
as the simple groups in the pro-p category. Many questions on pro-p groups can be reduced 
to the just infinite case, since (unhke in the profinite setting .Zalesskii f^2fl02l )) every finitely 
generated pro-p group poss esses iu s t infin i te pro-p quotients . 

Building on the work of iGrigorchukI showed that just infinite pro-p 



groups fall into the following two classes: 

(i) groups containing an open normal subgroup which is the direct power of a hereditarily 
just infinite pro-p group; 

(ii) pro-p branch groups. 

Groups from class (ii) have a natural action on T(p). In fact, one can define them as 
spherically transitive subgroups G C r(p) such that the rigid level stabilizers of G have 
finite index in G. The first example for such a group was the clos ure of the first Grigorchuk 



group, a subgroup of r(2) with many remarkable properties. iBartholdi and Grigorchuk 
( 2000[ ) calculated that i t has Hausdorff dimension 5/8 



In Idu Sautov et all ^2QQ(h . Boston suggested a direct connection between Grigorchuk's 



classes and Hausdorff dimension: he conjectured that a just infinite pro-p group is branch if 
and only if it can be obtained as a positive dimensional subgroup of r(p). In this section we 
contrast known properties of branch groups with new results on 1-dimensional groups. The 
first result concerns the normal subgroup structure of such groups. 

In general, the normal spectra of 1-dimensional groups can easily be the full interval [0, 1]. 
The simplest example for this is the stabilizer of an infinite ray in r(p); it is the full countable 
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direct power of T{p) with weighted dimensions However, if we assume that the group 

is spherically transitive, i.e., it acts transitively on every level, we get a com pletely different 



pictur e. It turns out that these groups, just as the free pro-p group (see iNewman et al. 
are "simple" in terms of Hausdorff dimension. 



Theorem 9.1. Let G C r{p) be a spherically transitive 1- dimensional closed subgroup. Then 
for every 1 ^ N < G we have dimn = 1. 

We need some technical lemmas. The first one is an asymptotic version of Theorem [7| 
As the existence of large Abelian subgroups in r„(p) shows, we have to assume that the 
density is close to 1. 

Lemma 9.2. There exist a constant C depending only on p such that for every G C r„(p) 
with 7ri,(G') > 1 — e we have 7n(G") > 1 — e — S where 5 = C min{n, — logp e}~^^'^. 

Proof. We consider the action of G on level n of T„. Let be the number of orbits of 
size j9* (0 < i < n), and let r denote the total number of orbits. Then 



n 



^ry=p" (22) 

and 

n n 

logp|G| < ^ri\ogp\Ti{p) \ = ^ 



Ti = — . 



i=0 i=0 ^ ^ 



From this and the density assumption 



logJG|>(l-e)^" 



=P ^ ' p-l 

we get the inequality r < e(p" — 1) + L Using the theorem of Kovacs and Newman ((l98^ 
with (b) and (c) from the proof of Theorem El we see that 

n ,• 
P 



\og^\G:G'\<KY,n-^ 



where K is an absolute constant. Separating the sum at an arbitrary m < n and using 

we get 



, /I ~ , frn 



j=0 



Therefore with mo = min {n, — logp e} and C = AKij) — 1) we get 



o _|_ m 

log, \G : G'\ < K{p- - l)—^ < Clogp |r„(p)|/V^, 



and this implies the stated bound on the density 7„(G'). □ 
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Consider a subgroup G C r„(p) where n may be infinite. We define the rigid stabilizer 
TZkG of level k as follows. Recall that the rigid vertex stabilizer TZ^G of a vertex v is the 
pointwise stabilizer of the vertices not descendant to v. Then TZ^G is defined as the group 
generated by the TZ^G of vertices v at level k. 

We need a slight reformulation of Theorem 4. of iGrigorchuk This is the key 



theorem to prove that the first Grigorchuk group, or more generally, torsion branch groups 
are just infinite. 

Lemma 9.3. Let G C r„(p) he transitive and let g E G he an element which moves a vertex 
at level k. Then the normal suhgroup generated hy g in G contains (TZk+iG)' , the derived 
suhgroup of the rigid stahilizer of level k + 1. 



The proof is exactly the same as in Grigorcliul] ()2000') 



We also need a lemma which measures how close subdirect products of high density are 
to direct products in size. 

Lemma 9.4. Let G C Tn{pY satisfying \G\ > IF^I'^"^. Then G contains a direct product 
H = Hix ■■■ X Hd with \H\ > |r„|'^(i-^). 

Proof. Let z < d be a coordinate. Project G to the coordinates not equal to i and let Hi 
be the kernel of this projection. Then the image has size at most |r„|'^~^ so \Hi\ > |r„|^~^. 
Since the full direct product H = Hi x ■ ■ ■ x H^ lies in G, the statement of the lemma 
follows. □ 



Now we are ready to prove Theorem El restated here. 

Theorem 13 Let G C T{p) he a spherically transitive 1-dimensional closed suhgroup. Then 
every nontrivial normal suhgroup of G is 1-dimensional. 

Proof. Let < G be a nontrivial normal subgroup, and fix k so that moves a 
vertex at level k - I. ¥oi n > k let G„ = Gr("Vr^"\ let Nn = Arr(")/r("). Then is a 
nontrivial normal subgroup of G„ C r„. Let K C Gn denote the stabilizer of level k. Then 
\Gn '■ K\ < iTkl SO hj the 1-dimensionality of G 

iog|/r| =iog|r„|(i-e„) 

where e„ — > as ^ oo. Now K acts on the subtrees starting at level k so it is naturally 
a subgroup of Tn-kipY ■ Lemma lOl implies that K contains a large direct product; in 
particular, the rigid stabilizer TZk = TZkG satisfies 

log|7^fe| >/iog|r„_fc|(i-/en) >iog|r„|(i-e;) 

with 0. By Lemma 021 the commutator R'l^ satisfies 

log |Ar„| > log |7^'J > log |r„| (1 - e; - 6n) 
where 5„ = C min{n, — log^ e'}^^/^. Letting n oo we get 'jn{N) 1, as required. □ 
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We are ready to prove Conjecture |H1 in the 1- dimensional spherically transitive case. 

Theorem ini Let G C V{jp) he a spherically transitive 1- dimensional closed subgroup. Then 
G contains a nonabelian free pro-p subgroup. 

In particular, G involves every finite p-group. Recall that a group G involves a group H 
if it can be obtained as a quotient of a finite index subgroup of G. Our first lemma confirms 
this for every 1-dimensional subgroup. 

Lemma 9.5. Let G C r{p) be a closed subgroup of dimension 1. Then G involves every 
finite p-group. 

Proof. It suffices to prove that G involves r„ for every n. We define the n-sample S^j 
of G at the vertex v of depth k as follows. Take the stabilizer of level k. Project this group 
onto the descendant subtree of v and cut it at level k + n. Let C P^ denote the action on 
this subtree. 

Now for any i it is easy to see that Gni = GT^^^^ satisfies 



\Gn£\ < Y\ 1*^^ 



log 


|*S't,| 


log 


|r„| 



where the product is taken over all vertices in T„£ at levels divisible by n. Then 

7ne{G) < < max 

El, log I rn I 

As £ oo, the left hand side converges to 1, so there exists a v such that 5*^, = P„. It is 
easy to see that G involves all its samples. □ 

As a corollary, 1-dimensional groups cannot satisfy any pro-p identity. It seems plausible 
that the lemma holds for arbitrary positive-dimensional closed subgroups. 

Let G C P(p) be a closed subgroup and let r be an infinite ray with vertex r„ at level 
n. There are two possible notions of the nth stabilizer of r in G: the stabilizer subgroup 
Fn^Gn = GP("Vr^"^ of and the congruence quotient = i\:p('^)/p(") of the stabilizer 
K C G of r. In general, Kn C Fn but they need not coincide. However, equality holds for 
spherically transitive groups of high enough dimension. 

Lemma 9.6. Let G C P(p) be a spherically transitive closed subgroup with dimn G > 1/p 
and let r be an infinite ray. Then there exists uq, such that for all n > uq we have Kn = Fn. 
In particular, dimn K = dimn G . 

Proof. Let Vn = {G n P„)P„+i/P„+i, the level n subspace of G. Then we have 
dim^n < p"' and logp|G„| = dimVo + ... + diml^_i. Our assumption and a straight- 
forward computation implies that there exists no such that for all n > uq the subspace Vn is 
nontrivial. 

Now let n > Uq and let G F„. We have to show that gn can be extended to K, that 
is, there exists g E K so that g/T^'^'^ = gn- In fact, using induction, it suffices to show that 
there exists an extension gn+i to Fn+i, the stabilizer of the vertex w of r at level n -t- 1. 
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Let 5* be the set of possible extensions of g to Gn+i Then S forms a coset of Vn in 
Gn+i- Now G is spherically transitive and is a nontrivial G„-invariant subspace, thus the 
projection of Vn to the coordinate v E r is the entire Fp. Since 5* stabilizes v, the projection 
of S' to f also equals Wp. So there exist Qn+i G 5* which stabilizes the children oi v, as 
required. 

For n > riQ we have |G„ : Kn\ = \Gn '■ Fn\ = p", so dimn K = dimn G. □ 

Proof of Theorem [21 Let r be the leftmost infinite ray, that is, the set of vertices 
indexed by sequences of zeros. Let denote the stabilizer of r in the finite group T/^lp). 
Let K be the stabilizer of r in G. We claim that K has a closed normal subgroup such that 
the quotient group is isomorphic to the full direct product D = Ylk^k- 

Choose no according to Lemma 19.61 From Lemma 19.51 it follows that for every k there 
exists infinitely many vertex v such that the fc-sample 5*^, = F^. Since G is spherically 
transitive, samples at the same level are isomorphic, so we can choose a strictly increasing 
sequence such that S*^^ = Fj^ where the vertex is at level on the ray r. We can also 
assume that nk+i — nk> k for all k. Now K naturally projects into by the following 

function tt. For g E K let the k-th coordinate 7rk{g) be the action of g on the subtree of 
length k starting at the vertex Vk € r. As g stabilizes r, the image T^kig) will lie in Fk. 

We show that the projection vr is surjective, that is, for every h G Ylk we can find 
g K such that i^ig) = h. We construct g hj a series of level extensions. From level Uk 
to level Uk + k we use the following argument. Since g stabilizes Vk, the set of possible 
extensions of g under Vk is the union of cosets of the sample Sy^ (G) . But Uk was chosen in a 
way such that the fc-sample Svf,{G) is the full Tk{p). This implies that up to level nk + k we 
can prescribe g in an arbitrary way, so we can assume TTk{g) = hk- For levels between Uk + k 
and rifc+i we use Lemma f9. 61 to obtain an extension which stabilizes Vk+i- Since G is closed, 
our series of extensions produces an element of G, so the claim holds and K projects onto 
D. 

Since Vk-iij)) is a subgroup of Fk for all /c, every finite p-group can be embedded into 
Fk for large enough k. This implies that the full product D has a nonabelian free pro-p 
subgroup topologically generated, say, by the elements x, y. Then any two preimages x, y 
in G generate a nonabelian free pro-p subgroup, since pro-p words satisfied by x and y are 
also satisfied by x and y. The proof is complete. □ 



IWilson ( 2nnfl( ) showed that just infinite pro-p branch groups co ntain dense free subgroups 



on the existence of dense free subgro ups in profinite groups see iPvber and ShalevI 1)200 ih 



and ISoifer and Venkataramana ()200 0h). The same holds for 1-dimensional subgroups. Let 



d{G) denote the minimal number of topological generators for G (this might be infinite). An 
abstract subgroup of a topological group G is dense if its closure equals G. 

Theorem IIOL Let G C F(p) he a closed subgroup of dimension 1 and let k > d{G). Then 
G contains a dense free subgroup of rank k. 

In other words, the free group F^ is residually S where S denotes the set of congruence 
quotients of G. 



Proof. By Corollarv 14.61 the subgroup generated by k random elements is free of rank 
k with probability 1. 
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If A; = oo, then even the set of k random elements is dense in G with probabihty 1, since 
G CT{p) has a countable base. If d{G) < k < oo then topological generation depends only 
on the Frattini quotient Cp^'^^ of G. This is generated by A; > d{G) random elements with 
positive probability. □ 
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